Performance of analytical methods for overdispersed counts in cluster randomized trials: sample size, degree of clustering and imbalance.

نویسندگان

  • Gonzalo Durán Pacheco
  • Jan Hattendorf
  • John M Colford
  • Daniel Mäusezahl
  • Thomas Smith
چکیده

Many different methods have been proposed for the analysis of cluster randomized trials (CRTs) over the last 30 years. However, the evaluation of methods on overdispersed count data has been based mostly on the comparison of results using empiric data; i.e. when the true model parameters are not known. In this study, we assess via simulation the performance of five methods for the analysis of counts in situations similar to real community-intervention trials. We used the negative binomial distribution to simulate overdispersed counts of CRTs with two study arms, allowing the period of time under observation to vary among individuals. We assessed different sample sizes, degrees of clustering and degrees of cluster-size imbalance. The compared methods are: (i) the two-sample t-test of cluster-level rates, (ii) generalized estimating equations (GEE) with empirical covariance estimators, (iii) GEE with model-based covariance estimators, (iv) generalized linear mixed models (GLMM) and (v) Bayesian hierarchical models (Bayes-HM). Variation in sample size and clustering led to differences between the methods in terms of coverage, significance, power and random-effects estimation. GLMM and Bayes-HM performed better in general with Bayes-HM producing less dispersed results for random-effects estimates although upward biased when clustering was low. GEE showed higher power but anticonservative coverage and elevated type I error rates. Imbalance affected the overall performance of the cluster-level t-test and the GEE's coverage in small samples. Important effects arising from accounting for overdispersion are illustrated through the analysis of a community-intervention trial on Solar Water Disinfection in rural Bolivia.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Important considerations in calculating and reporting of sample size in randomized controlled trials

Background: The calculation of the sample size is one of the most important steps in designing a randomized controlled trial. The purpose of this study is drawing the attention of researchers to the importance of calculating and reporting the sample size in randomized controlled trials.    Methods: We reviewed related literature and guidelines and discussed some important issues in s...

متن کامل

How large are the consequences of covariate imbalance in cluster randomized trials: a simulation study with a continuous outcome and a binary covariate at the cluster level

BACKGROUND The number of clusters in a cluster randomized trial is often low. It is therefore likely random assignment of clusters to treatment conditions results in covariate imbalance. There are no studies that quantify the consequences of covariate imbalance in cluster randomized trials on parameter and standard error bias and on power to detect treatment effects. METHODS The consequences ...

متن کامل

Design of cluster-randomized trials of quality improvement interventions aimed at medical care providers.

BACKGROUND Randomized trials aimed at improving the quality of medical care often randomize the provider. Such trials are frequently embedded in health care systems with available automated records, which can be used to enhance the design of the trial. METHODS We consider how available information from automated records can address each of the following concerns in the design of a trial: whet...

متن کامل

Planning a cluster randomized trial with unequal cluster sizes: practical issues involving continuous outcomes

BACKGROUND Cluster randomization design is increasingly used for the evaluation of health-care, screening or educational interventions. At the planning stage, sample size calculations usually consider an average cluster size without taking into account any potential imbalance in cluster size. However, there may exist high discrepancies in cluster sizes. METHODS We performed simulations to stu...

متن کامل

A Multi-Objective Approach to Fuzzy Clustering using ITLBO Algorithm

Data clustering is one of the most important areas of research in data mining and knowledge discovery. Recent research in this area has shown that the best clustering results can be achieved using multi-objective methods. In other words, assuming more than one criterion as objective functions for clustering data can measurably increase the quality of clustering. In this study, a model with two ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Statistics in medicine

دوره 28 24  شماره 

صفحات  -

تاریخ انتشار 2009